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1, INTRODUCTION 

Landslide is a geological phenomenon in which involves any movement of debris or soil down a 
slope or a mass of rock considered [1]. The probability of movement of the soil downhill under the influence 
of the gravity, precipitation and slope incline is also known as landslide [2]. It basically involves the shift of 
ground and rock due to soil erosion. Landslides have been declared as significant geological hazards and a 
natural source of devastation around the world. Many studies have been carried out by several researchers in 
many landslides hazardous area around the world. Landslides damage the infrastructures, properties and 
claim human lives. The main focuses of studies are inventory of landslides, impact of geological, 
topographical, hydrological and anthropogenic factors on the occurrences of landslides and the role of several 
triggering factors like rainfall and other calamites. There are several causative factors of landslide 
occurrences. These factors are divided into various categories like geomorphology, geology, metrological 
soil, land cover and hydrologic conditions [2-3]. 

Landslides occur frequently in Malaysia due to heavy rainfall throughout the year. Most of the 
landslides happen in annual monsoon i.e. Southwest monsoon (May — September) and Northeast monsoon 
(November — March). It has been noted that many damages have been done due to landslides in Malaysia 
from 2000 to 2009 [4-5]. A lot of research and methods have been proposed to reduce and minimize the 
damages caused by landslides [6-7]. This 1s accomplished by predicting the risky areas and act as a warning 
so that proper measures can be taken [4], [8-9]. It 1s not possible to predict a landslide occurrence in time and 
space, therefore a region of landslide is considered and is classified into different categories of potential 
hazards [2]. 
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GIS and various evaluation techniques have been used for evaluating landslide hazards. The most 
attractive methods for computing the landslide hazards maps are bivariate statistical methods such as 
frequency ratio, statistical index and information value [6], [10-14]. Although advanced computational tools 
such as artificial neural network and decision tree have been applied successfully in generating landslide 
hazard maps, these techniques usually consume long processing time as compared to bivariate statistical 
methods [15-18]. Besides that, long computation time is also a challenging problem in the real-time online 
system. Up-to-date landslide map provides very crucial information for warning system which needs to be 
updated in short time. To overcome the computation time issue, bivariate statistical methods such as 
Frequency Ratio (FR) and Information Value (IV) are proposed in this paper for their easy implementation 
and fast in producing hazard maps with good accuracy [19]. This paper presents the analysis of landslide 
hazard mapping of Penang Island using bivariate methods, 1.e. Frequency Ratio (FR), Information Value (IV) 
and Modified Information Value (MIV). These bivariate statistical approaches are computed using 
MATLAB software. 


2. STUDY AREA 

Due to frequent occurrences of landslide events in Penang, Malaysia, Penang Island is selected as 
the study area in this paper. Penang 1s situated on the North West of Malaysia Peninsula. It is bounded with 
the state of Kedah (North and East), the state of Perak (South) and the Strait of Malacca and Sumatra, 
Indonesia (West). Penang consists of Island as well as coastal strip on the mainland which is also named as 
Province Wellesley. In this research, the landslide hazard analysis study area is the island of Penang. 
The area of island of Penang is 285km’. It is located at 5°15'N to 5°30’N latitudes and 100°10’E to 
100 20’E longitudes. The land of Penang Island basically consists plants, forest, greenland, swamp and 
urban areas. The ranges of slope and elevation of the terrain above the sea level are 0° to 87° and 0 to 820m 
respectively. The temperature of Penang Island ranges from 29°C to 32°C. The average amount of 
precipitation lies between 2254mm to 2903mm yearly. The major types of vegetation in Penang Island are 
forest and fruit plantation. This island is also shaped by the fault lines. These fault lines are basically present 
from North to South in the center of the island. Figure 1 shows the map of Penang Island. 





Figure 1. Penang Island (Google map) 


3. DATA SET 

Remote sensing method and Geographical Information System (GIS) are used to collect data on 
geological database of Penang Island. Numerous images of Penang Island are acquired from several 
governments sectors in Malaysia like Department of Meteorological Malaysia, Department of Irrigation and 
Drainage (DID), Department of Agriculture (DOA), Department of Minerals and Geo-Science (JMG) etc. 
The occurrence points of landslide in Penang Island are also accumulated and converted into spatial datasets. 
These dataset with resolution of 5 m is used in this study. The landslide occurrence points of the entire Island 
are collected. The hilly topography of the island is in the central of the island and considered as the most 
occurred landslide area. The landslides mainly involve shallow rotational debris slides, debris flows and 
rock falls. 

The relationship of causative factors with slope failure and identification and mapping of instability 
should be done to know the main cause of landsliding [20]. Before applying the probabilistic approaches, 
the datasets of landslide causative parameters are gathered and transformed into spatial dataset. The twelve 
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(12) landslide causative parameter are taken in account in computing of Frequency Ratio (FR), Information 
Value (IV) and Modified Information Value (MIV) of landslides. By using topographic database, a digital 
elevation model (DEM) is developed with a resolution of 5 m. Elevation is obtained from DEM and then 
further used for calculating the slope gradient angle, slope aspect and slope curvature. The road map and 
drainage map are used to find the distance from road and drainage respectively. The buffer zones for line 
features should be prepared as 50 m [21-22]. The distances from fault lines are divided into 100m 
segregations. The soil texture of the island is also acquired, and it consists of six types of soil texture. 
Geology map covers six types of different granites of Penang Island. The natural triggering factors of 
landslide in island is precipitation. Due to rainfall, the soil dampens, debris and rocks wash away. 
The inverse weight distance interpolation method is used to produce the precipitation map because of limited 
rain measurement stations in the island. The landslide causative parameters maps are shown in the Figure 2. 
These maps are obtained by using the ARCGIS software. 
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Figure 2. The maps of landslide causative parameters 
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4. METHODOLOGY 

The bivariate statistical methods are used to produce the landslide hazard maps. These methods have 
several advantages such as the results obtained are highly efficient and accurate, low cost, straightforward 
execution and well perception of landslides and landslide causing factors relations. Each factors data weight 
1s computed by using these methods. It depends on class allocation and landslide intensity [23]. These 
weights are very important in representing each factor to landslide occurrence. The landslide hazard index 
(LHI) is also produced by using factor’s data weights. 

The main and important type of bivariate statistical 1s Frequency Ratio (FR) approach. It 1s based on 
the examined relationships between the landslide distribution and landslide contributing factors. The 
weightage of all class is obtained by dividing the class landslide density with the overall landslide density. If 
FR is greater than | then higher correlation of landslide occurs and vice versa. The FR of every pixel of the 
map is summed up to calculate LHI. It 1s concluded that the areas that have higher FRs are more inclined to 
landslide. FR and LHI is calculated by (1) and (2) respectively. 
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Npix(s,)/ Npix(n;) 


FR, = 1 
UN yix(s,)/2 N pix(N;) ©) 

LHI pp = FReotat = Liza FR; (2) 

where, Npixcs,) Total number of pixels with landslide in class 1, 

Nyix(v,) Total number of pixels with class U in the study area, 

> Nnix(s,) Total number of pixels containing landslide, 

» Npixtw,) Total number of pixels in the study area, 

n Number of classes of the causative factors. 


Another model of bivariate statistical that is used in this study is an Information Value (IV) model. 
It is used to compute the information value of each class for all landslide causative factors. The (3) is utilized 
to form the information value for all classes of the factors [24-25]. 


N ix )/N ix(N; 
Iv, = in( reetsornete_) 3) 


UN pix(s;)/™ Npix(n) 
The total information value for all pixels 1s computed using (4) as: 
LHI y = Viotat = Liar lV; (4) 
From (3), it is observed that if /V; has a higher positive value then more landslides occurrence and 
vice versa. It is also noticed that a class has no landslide, IV; is replaced by the minima of the minimum of 


each class. To overcome this issue, a new method 1s proposed. The model is known as Modified Information 
Value (MIV) model and can be computed in (5). 


Noix(s,\/ N pix N;) 
MIV, = lo a POD PAN 4 7 5 
pee (; Nyix(s;)/2 pix(n;) ) 
LHI yyy = MIVeo¢a = Li=1 MIV; (6) 


The main three conditions are observed form (5). First, if M/V; = 0, then no landslide exists in a 
particular class. Second, if M/V; = 1, it means that the proportion of landslide pixels in class 1 is equal to the 
proportion of the whole study area. Thirdly MIV; > 1, it means that the more landslides pixels present in the 
class t than those in the whole study area. The higher the MIV, the greater landslides proportion in the class 
and vice versa. 


5. RESULTS AND ANALYSIS 

In this study, the accuracy (AUC) of landslide hazard mapping are verified by using Receiver 
Operating Characteristic (ROC) method. There are two assumptions taken to validate the maps. First, the 
landslides are linked to spatial data like topography height and slope gradient. Second, the triggering factors 
such as rainfall affects the future landslide [26]. 

The first step in verification by ROC method is to arrange the LHI in a descending sequence. 
These indexes are then segregated into one hundred (100) subclasses on y-axis with 1% separations on x-axis 
[4]. The curve illustrates the accuracy of the approaches and factors predict the upcoming landslides. The 
accuracy of the approach is obtained by area under the curve [26-27] The accuracies of each bivariate 
Statistical method are obtained as 79.58% for FR, 79.14% for IV and 79.37% for MIV from the ROC curves 
shown in Figure 3. It is clearly shown that FR method is more accurate than the others. However, MIV model 
gives much closer accuracy with FR method than IV model. Landslide hazard maps are computed by LHI 
obtained from three bivariate statistical methods. LHI 1s classified into four categories. 
a) Highly hazardous for the maximum 10% of LHI (90-100%) 
b) Hazardous for the following 10% of LHI (80-90%) 
c) Moderately hazardous the next 20% of LHI (60-80%) 
d) Non-hazardous for the remaining 60% of LHI (0-60%) 
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Figure 3. ROC curves of FR, [V and MIV methods 


Landslide hazard maps pf Penang Island generated using FR, IV and MIV are shown in Figure 4 (a), 
(b) and (c) respectively. In these figures, red color indicates for highly/seriously hazardous area, green color 
for hazardous areas, blue color for moderately hazardous areas and white color for non-hazardous areas. 





(b) 


Figure 4. Landslide hazard maps (a) FR (b) IV (c) MIV 


6. CONCLUSION 

In Malaysia, landslide has caused many causalities and damages. It is one of the severe threatening 
disaster occurs in nature and is generally originated under complex environmental situations. However, a lot 
of construction and development has carried out in the hilly regions due to the limited flat terrain in Penang 
Island. Therefore, a landslide alerting scheme is regularized to warn people about the occurrence of 
landslides. In this paper, landslide hazard mapping of Penang Island is analyzed using bivariate statistical 
approaches. Bivariate statistical approaches are simple methods which take short computation time and 
produce results with good accuracy. FR, IV and MIV models are used to compute the landslide hazard maps 
of Penang Island. The accuracies of FR (79.58%), IV (79.14%) and MIV (79.37%) models are calculated by 
applying the ROC curve. The bivariate methods applied in this paper, i.e. frequency ratio, information value 
and modified information value provide good results in predicting the landslide events. 
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